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Abstract 

We calculate the probability distribution of repetitions of ancestors in a genealogical 
tree for simple neutral models of a closed population with sexual reproduction and 
non-overlapping generations. Each ancestor at generation g in the past has a weight 
w which is (up to a normalization) the number of times this ancestor appears in 
the genealogical tree of an individual at present. The distribution P g (w) of these 
weights reaches a stationary shape P OCl (w), for large g, i.e. for a large number of 
generations back in the past. For small w, Poo(w) is a power law (P 00 (w) ~ w@), 
with a non-trivial exponent (3 which can be computed exactly using a standard 
procedure of the renormalization group approach. Some extensions of the model are 
discussed and the effect of these variants on the shape of P QO (w) are analysed. 
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1 Introduction 



Non-trivial power laws are known to characterize second order phase transi- 
tions. A great success of the theory of critical phenomena has been to develop 
methods allowing to predict these power laws [1]. One of the most successful 
approaches used in the theory of critical phenomena is the renormalization 
group, which consists in trying to relate physical properties of a given sys- 
tem at different values of the external parameters (like the temperature or 
the magnetic field). In the last three or four decades, other non-trivial power 
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laws [2] have been found in all kinds of systems: Transition to chaos by period 
doubling [3], geometrical problems like self avoiding walks (which model poly- 
mers) and random walks [4], sand pile models and several other self organised 
critical systems [5], coarsening [6], etc. In many cases, renormalization ideas 
could be extended to predict the exponents of these power laws. 

In this work, we report recent results on simple models of genealogical trees 
[7]. When one looks at the distribution of repetitions in a genealogical tree (in 
the framework of the simple models defined below), one observes non trivial 
power laws. The exponents of these power laws can be calculated exactly by 
writing a relation on the generating function of the weights of the ancestors 
(a quantity proportional to the number of times they appear in a genealogical 
tree) which has the form of a simple renormalization transformation. Beyond 
the intrinsic interest of these models to describe real genealogies, they consti- 
tute simple pedagogical examples for which renormalization ideas allow the 
exact prediction of non trivial exponents. 



2 Neutral models of genealogical trees 

2. 1 The random parent model 

Let us first consider a simple neutral model of a closed population with sexual 
reproduction. By definition of the model, the population size at generation g 
in the past is N g and each individual at generation g has two parents cho- 
sen at random among the N g+ i individuals in the previous generation g + 1. 
Here g counts the number of past generations and so increases as one climbs 
up a genealogical tree. For simplicity we will consider either a population of 
constant size (N g = N) or a population size increasing exponentially with an 
average number p/2 of offsprings per couple, i.e. N g = (^j N as g counts the 
number of past generations; iVo is the size of the population at present, while 
the constant size case corresponds to p — 2. 

A related model was introduced to study the genetic similarity between indi- 
viduals in a population evolving under sexual reproduction [8], although there 
the two parents were distinct. We do not exclude this case here. 

Clearly, the number of branches of the genealogical tree of any individual 
increases like 2 9 and, as soon as the number of branches exceeds N g , there 
should be repetitions in this tree. Let us denote by r*f\g) the number of 
times that an individual i living at generation g in the past appears in the 
genealogical tree of individual a. At generation g — 0, the only individual in 
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the tree of a is a itself, therefore 

rl a) (0) = 6 ha (1) 

and the evolution of these repetitions satisfies the recursion 

rt\g + l)= E rf\g). (2) 

j children of i 

The quantity we want to consider is the probability H(r, g) that an individual 
living at generation g in the past appears r times in the genealogical tree of 
individual a (living at generation 0). Normalization implies 

£tf(r,<?) = l, (3) 

r>0 

the initial condition (1) gives 

ff M ) = i-* r , 1+ (l-i-)* r , , (4) 

and the fact that each individual has two parents at the previous generation 
gives 

Y,rH(r,g) = ^- (5) 

These probabilities H(r, g) can be measured by simulating small systems 
through a Monte Carlo procedure: For each individual of a population at 
generation g, two parents are chosen at random among the N g+ i individuals 
at generation g + 1. Figure 1 shows the results of such simulations for two 
populations of constant sizes, N g = N for several values of g with No = 1000 
in fig. la and N = 10000 in fig. lb. 

We see that for small g there are very few repetitions and H(r, g) decreases 
very fast with r. On the other hand, when g increases, the shape of if (r, g) 
becomes independent of g and of the population size N, with a clear power 
law at small r and a fast decay at large r. Figure 2 shows the distribution 
H(r, g) for several values of g and a population which increases exponentially 
with time, N g = 3 10 ~ g 2 9 . Here, again, the shape becomes stationary in the 
interval where g is large enough and N g is still large. This stationary shape is 
different from the one seen in fig. 1. 
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Fig. 1. Probability distribution H(r,g) of r repetitions after g generations (H(0,g) 
is not shown) at g = 5, 9, 12, 14, 16, 18, and 20 for a population of constant size. In 
figure la, N = 1000 and in figure lb, N = 10000. Both figures show averages over 
1000 samples. 

The shape of H(r, g) becomes stationary for large N g and large g in the sense 
that one gets a fixed distribution by an appropriate rescaling. In fact, intro- 
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Fig. 2. Probability distribution H(r, g) for a population size increasing by a factor 
3/2 at each generation. Here N g = 3 W ~ 9 2 9 , and averages over 5000 samples are 
performed. The generations shown are g = 8, 10, 11, 12, 13, 14, and 15. 

ducing the rescaled quantities w and P g (w) 



N 9 

w = — - r 

29 



23 



P g ( w ) = w H(r,g), 



(6) 
(7) 



where w can be considered as a continuous variable for N g 2 9 , (3,5) trans- 
form into 



J P g (w)dw = J wP g (w)dw 



(8) 



and we expect P g (w) to become a fixed distribution P 00 (w). This means that 
if we associate to each individual i in the tree of a at generation g in the past 
a weight defined by 



(a) (a) 



(9) 



the distribution of these weights becomes stationary in the scaling limit. 
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From (2,9) it is clear that these weights satisfy 



1) 



2N, 



E 



(10) 



9 j children of i 



As we limit ourselves to the case of a population increasing exponentially at 
rate p/2 per generation (so that N g = (-) N ), (10) reduces to 




The ratios wf'(g)/N g can be interpreted as the probability of reaching indi- 
vidual % by randomly climbing up the genealogical tree of a. In the particular 
case of a population of constant size (p = 2), the factor 1/2 in (11) is easy 
to understand. For a population of increasing size (p > 2), there is a factor 
\jp in (11) instead of 1/2 because of the factor N g in the definition (9) of the 
weights w\°^ . 

The key observation which allows one to calculate the distribution P g {w) in 
the scaling limit (large g and large N g ) is that, for large N g and for large g, the 
random variables which appear in the r.h.s. of (11) become independent. 
This is due to the fact that (at least in the model we consider) the weights 
Wj a \g) (of brothers and sisters) in the r.h.s. of (11) are uncorrelated. This 
independence, which is discussed in the appendix, will be the basis of the 
calculation of the fixed distribution Poo(w) in the following sections. 

2.2 Variants of the model 

One can consider some variants of the model defined above, for instance: 

• At each generation one could form fixed couples by making random pairs 
and assign to each individual at generation g one of these pairs (of parents) 
chosen at random at the previous generation (g + 1). In this case the cor- 
relations between the weights w g would again be small in the scaling limit 
and they can be ignored in the r.h.s. of (11). 

• One can also consider an imaginary situation where each individual has 
p' 7^ 2 parents (instead of p' — 2). In this case, the definition of the weights 
(9) should be replaced by 




wl a \g+l) 



- E w T\9). 



(ii) 



j children of i 




(P') 9 



(12) 
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to keep P g (w) normalized as in (8). For a population of constant size N g = 
N, the evolution of the weights (11) becomes 

^ a) (9 + l) = ^ E ™f(9)- (13) 

" j children of i 

As shown in the appendix, in the scaling limit, the correlations on the r.h.s. 
of (13) can be neglected in this case too. 

In the remaining of this work, we try to predict the stationary shape P^w). 



3 Generating function 



The fact that the weights in the r.h.s. of (11) are uncorrelated greatly simplifies 
the problem. One can then consider that w\ a \g+l) is the sum of k independent 
identically distributed random variables Wj a \g), where k is itself random. The 
probability of k is clearly 




which for large N g becomes (using the fact that N g+i = 2N g /p) a Poisson 
distribution 

Qk = £e-* • (14) 



Therefore for large N g , the number k of terms (k is the number of children of 
i) in the r.h.s. of (11) is randomly distributed according to (14) and these k 
terms are uncorrelated. This becomes a problem of branching processes [9] . If 
one introduces the generating function Q(X,g) 



Q(\,g) = (exp[\ w^\g)]) 



(15) 



and uses (11) and the fact that the weights are independent, one finds that 
Q{\g) satisfies 



Q(\g + 1) = E^ Q \ ~>9 J = ex P 



-p + pQ\-,9 
\p , 



(16) 



The normalization (9) of the w\°^ (g) implies that we have for all g 

Q(0,g)=Q\0,g) = l. (17) 



7 



Recursions similar to (16) appear in the theory of branching processes, in par- 
ticular in the Galton- Watson process, already introduced in the 19th century 
to study the problem of the extinction of families [9]. 

From (15,16), one can easily obtain recursions for the moments of the weigths 

(a) 



(w(g + l)) = (w(g)) = l (18) 

(w 2 (g + l))= 1 -(w 2 (g)) + l (19) 

(w 3 (g + 1)) = \(w 3 (g)) + -(w 2 (g)) + 1 (20) 

(w\g + l)) = ^(g)) + 4> 3 (<?)> + jM(9)) 2 + Vg?)> + i (21) 



and so on. We see that for large g, each moment of w ( f l \g) has a limiting 
value, as expected from the observation in the previous section that P g (w) 
converges to a fixed distribution Pqo(w) such that 



Q(A,oo) = J e Xw PooH dw. 



(22) 



The limiting values of these moments 



(w 2 (oo)) = 
(w 3 (oo)) = 
(w\oo)) 



p 



(p-1) 

p 2 (p + 2) 

(p_l)(p2_l) 

p 3 (p 3 + 5p 2 + Qp + 6) 



(p_l)(p2_l)(pS_l) 

etc., can be obtained directly by expanding the solution Q(X, oo) of 



Q(X, oo) = exp 



-p + p Q ( -, oo 
\P j 



(23) 
(24) 
(25) 



(26) 



around A = (choosing as normalization Q'(X, oo) = 1), 



Q(X, oo) = 1 + A + 



P 



-X 2 + 



p 2 {p + 2) 



2(p-l) Q(p - l)(p 2 - 1) 



A 3 



+ ^ + + . A 4 + 0(A5) . (27) 



24(p- l)(p 2 - l)(p 3 - 1)' 
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Several other properties of Q(X, oo) can be obtained from the fixed point 
equation (26) or from the recursion (16). The simplest one is the limit 



S= lim Q(A,oo), (28) 



where S is the solution (S ^ 1) of 

S = e' p+pS (29) 

This limiting value (S = 0.20318787... for a population of constant size, 
i.e. p = 2) is the coefficient of 5(w) in P^w) and so is the fraction of the 
population whose descendants become extinct: There is a fraction e~ p of the 
population with no children, a fraction e~ p+pe P — e~ p of the population with 
children but no grandchildren, and so on, and the sum of all these contributions 
gives S. 

Equations (16,26) have the form of a real space renormalization [10]. As a 
consequence, one can predict that for A — > — oo, Q(X, oo) approaches its limit 
as a power law, 

Q(A,oo)-S~ |A|-^ _1 , (30) 
where the exponent (5 must be 

for the terms of order |A| _/3_1 on both sides of (26) to be equal. For p — 2, 
this gives (3 = 0.2991138 . . . and (22) implies that at small w, the distribution 
Poo(w) is a power law 

PooH ~ w 13 (32) 

with j3 given by (31), in agreement with the results of the simulations shown 
in figures 1 and 2. 

In fact, for A — > — oo, the leading contribution in the difference Q(X, oo) — S 
consistent with (26) is 



Q(\,oo)-s*\\\-e-*F p (^ 



(33) 
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Fig. 3. The product |A| -/3-1 [Q(A, oo) - S 1 ] versus ln(-A)/lnp for p = 5,7,10. We 
see clearly the periodic nature of the amplitude predicted by (33). Discrepancies at 
small —A are due to the fact that the asymptotic regime is not yet reached. At too 
large —A, rounding errors in the difference Q(X, oo) — S make the resulst noisy and 
unreliable. 

where F p (z) is an arbitrary periodic function (not necessarily constant) of 
period 1 (i.e. F p (z + 1) = F p (z)). Such periodic amplitudes are often present 
in the critical behavior of systems which have a discrete scale invariance [11]. 
It is easy to calculate numerically the function Q(X, oo) for all values of A 
from the fixed point equation (26) which relates A to points X/p n arbitrarily 
close to 0, where the linear approximation Q(X, oo) ~ 1 + A = O(A) 2 becomes 
excellent. Using this procedure, we could determine (figure 3) the combination 
[<3(A,oo) — S']|A|~ /3 ~ 1 and the non constant periodic nature of the amplitude 
F p (z) is visible if p is large enough. The analytic determination of F p (z) is in 
principle possible [12,13] for p close to 1, but remains difficult for arbitrary p. 

The knowledge of the periodic function F p (z) determines in principle the whole 
expansion of Q(X, oo) in the limit A — > — oo. If we look for a solution of (26) 
which starts as (33) as A — > — oo, one finds by equating the two sides of (26) 
order by order in powers of |A| _/3_1 , 



Q(A,oo) = S + 



F 



+ 



p 



2(pS-l) 



'f r^v 

IAP+ 1 



10 



P 2 (pS + 2) 



) 



3 



6( P S - l)((pS) 2 - 1) |A|^ +1 



+ ... (34) 



In addition to the moments (23-25) of P 0O (w) (which are given by the expan- 
sion (27) of Q(A, oo)) and the exact values (29,31) of S and f3, let us just 
mention two properties of the solution of (26) which we checked by rather 
involved ways, and that we prefer to leave as conjectures: 

• Q(X, oo) is analytic in the whole complex plane of A 

• Q(\, oo) grows extremely fast (faster than the exponential of the expo- 
nential ... of the exponential of A) as A — > oo. As a consequence, for large 
w, Poo(w) decays faster than any exponential but slower than any stretched 
exponential (of exponent larger than 1) and even 



All the discussions of the present section can be repeated in the case of having 
p' parents. If we limit ourselves to a population of constant size (as we did to 
obtain (13)), we find that Q(X, oo) satisfies the same fixed point equation (26) 
as above with p replaced by p' 



This means that the distribution of the weights w is exactly the same for the 
cases of (i) 2 parents and a population size increasing exponentially by a factor 
p/2 at each generation and (ii) a population of constant size with p parents 
per individual. This can be checked by comparing figure 2 and figure 4, where 
we show the distributions H(r,g) for a population of constant size N = 1000 
and iV = 10000 with 3 parents per individual. 



4 Perturbation theories 

Despite its simplicity, it is not easy to extract more information on the function 
Q(X,oo) and consequently on the distribution Poo(u>) from the fixed point 
equation (26). There are however two limiting cases around which one can 
apply a perturbation theory and extract a few more properties of the fixed 
distribution: p close to 1 and p very large. 
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In Poo (w) 



•C In w. 



(35) 
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(36) 
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Fig. 4. The function H(r, g) for a population of constant size with (a) N = 1000 
and (b) N = 10000 when the number p of parents is 3. The generations shown are 
g = 3,5,7,9,11,12, and 13. 

4-1 p close to 1 



One can see from (23-25) that when p — > 1, the successive moments of the 
weight w diverge like (w n ) ~ (p — l) 1_n . This indicates that if one writes 

p = 1 + e (37) 
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the solution of the fixed point equation (26) can be expanded in the following 

way 

Q(A, oo) = 1 + eh (±\ + e 2 f 2 + e 3 /3 (±) + 6 4 /4 (fj + (38) 



where the functions /i,/ 2 ,... resum the most divergent terms in the pertur- 
bative expansion (27) in the range A = 0(e). If we insert the expansion (38) 
into (26) we get, by equating the two sides order by order in e, a hierarchy of 
differential equations for the functions f±, / 2 , ... which can be solved and lead 
to 



h(y) 
/ 2 (y) 

My) 



x 2 



+ 



In 



1 - 



y 



(39) 
(40) 



Uy 3 -3y 2 17y 2 -Qy , 
H : — In 



36 (1 - 1) 36 (1 - 1)' 



1 - ; 
y 2 + 2y 



36 (l - *y 



In 2 



(41) 



Comparing these expressions for large negative y with (34), one gets the ex- 
pansions of S, j3 

28 



1 - 2e + -e z - 



8= 1 e 

H 3 18 540 



+ o( £ <) 



3 • oW 



which both agree with what one would get by directly expanding (29,31). 
What the small e expansion gives us in addition is the function F p (z) which 
is found to be a constant function of z to all orders in powers of e, 

32 



F p 02)=4e 2 -- e 3 + 18e 4 + O(e 5 ) 



The non-constant nature of F p (z) does not show up in the expansion in powers 
of e. It is a non-perturbative contribution (which vanishes to all orders in 
e — p — 1) which could be calculated [12] using WKB-like techniques [13]. 

From (38-40) and the definition (22) one finds that, for small e, the continuous 
part of PooiyW) is an exponential 



l-2e + ^M 5(w) + 4e 2 e- 2ew . 
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Fig. 5. The fixed distribution Pqo(w) (the delta function contribution at w = is 
not shown) for p = 2, with N = 2 15 and g = 25; p = 3, N = 3 10 , g = 18; p = 4, 
AT = 4 8 , ^ = 14; p = 5, N = 5 6 , 5 = 11; p = 6, N = 6 6 , 3 = 11; and p = 7, N = 7 5 , 
g = 9. Averages over 1000 realizations have been carried out. The insert shows how 
the maximum w* varies with p. 

Corrections to this exponential shape are extractable from higher order terms 
(/2, /3, • • • )• 



4-2 large p 



The other case which can be dealt with perturbatively is the limit of large p. 
If p is large and A = Ofjo 1 / 2 ), the solution of (26) is given by 



l„Q(A,oo) = A + | + ^ 



+ 



A 5 



A^ _ 
2p 3 + 120> 4 



A^ _A*_ 
2p 2 24p 3 

+ o( P - 2 ) , 



(42) 



where each term represents a new order in powers of p 1 I 2 . This implies that 
P 00 (w) can be written in terms of x = w — 1 in the range x ~ p~ x l 2 as 



K ' V27T 



1 + 



par 



2 + hr 



px 4 7x 2 7 N 
IT + ~8~ ~ 12p, 
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+ 



p 3 x 9 
1296 



p 2 x 7 19px 5 



80 



95x 3 



x 

8p, 



+ 



(43) 



where each parenthesis represents a new order in p^ 1 / 2 . The Gaussian shape 
in (43) is not a surprise considering that, for large p, each weight becomes the 
sum of a large number of independent contributions. 

One property which can be extracted from (43) is the location of the maximum 

W* Of Poo(w) 



w = 1 - 

2p 24p 2 



+ 



p 3 



(44) 



Figure 5 shows the shapes (obtained by random samplings populations of 
constant sizes with p parents per individual) of the distribution P 00 (w) for 
several choices of p. The insert shows the values of w* extracted from these 
data. They agree with the prediction (44) that the maximum approaches 1 
with corrections of order 1/p as p becomes large. 



5 Conclusions 



We have seen that for simple neutral models of evolution with random mating, 
the distribution of ancestors repetitions in the genealogical tree of a present 
individual becomes stationary, with a fixed shape Pqo(w) which can be de- 
scribed by a fixed point equation of the type (26). This shape is the same if 
one considers a population increasing exponentially at rate p/2 per generation 
with two parents per individual or a population of constant size with p parents 
per individual. 

The fixed point equation (26) allows one to determine exactly the exponent 
(3 which characterizes P^w) at small w. The determination of [3 from (26) is 
very reminiscent of the way one finds exponents in the renormalization group 
approach of critical phenomena. Other properties (large w behavior, ampli- 
tude of the power law, . . . ) of the fixed distribution Poo(w) are in principle 
extractable from (26) but are more difficult to obtain than the exponent (3. 

The present work admits several extensions. In particular, one may consider 
the case where the probabilities q k (that an individual has k children) is arbi- 
trary (instead of Poissonian as in (14)). The fixed point equation (26) becomes 
then simply 

Q(\, oo) = V q k Q { -, oo ) 
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and starting from this new fixed point equation, one can essentially repeat all 
the above calculations, including the determination of the exponent (3. If all 
the qk vanish for k > /c max , one can see that for large A, 

lnQ(A,oo) ~ A lnfcm -/ lnp . 

Consequently, the distribution Poo(w) becomes a stretched exponential for 
large w, 

In Poo (w) ~ _ W lnfe m ax/ln(fe max /p)_ 

Recursions similar to (11) describe the distribution of constraints in granular 
media [14]. In such cases, the number of grains in direct contact and supporting 
the weight of a given grain is variable. This would correspond to considering 
that the number p' of parents is no longer constant over the whole population 
but may vary from individual to individual. 

Finally let us mention that an interesting aspect of the problem is the cal- 
culation of the correlations between the genealogies of several contemporary 
individuals. One can show [15] that for large g, the weights of all the ancestors 
of two distinct individuals in the same population become the same after a 
number of generations g c oc In N. 



6 Appendix: The correlations of the weights 

In this appendix we show, by calculating moments of the weights Wj a \g), that 
correlations become negligible in the r.h.s. of (11) and (13). 

6.1 The case of a varying population size with 2 parents per individual 

It is convenient to rewrite (11) as 

»! tt) (3+l) = -Ex(M>S a, ( 5 ) (45) 

P 3=1 

where 

{0 if % is not a parent of j 
1 if % is one of the two parents of j . (46) 
2 if % is the two parents of j 
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For the random parent model of section 2 (where each parent of j is chosen at 
random among all the individuals of the previous generation), x(hj) = with 
probability (1 — 1/N g+1 ) 2 , x(hj) — 1 with probability 2(1 — 1/N g+1 )/N g+1 and 
x(hj) — 2 with probability 1/N 2 +1 (as we did not exclude choosing the same 
parent twice). Moreover there is no correlation between x(hj) an d x{i' ■>?) if 
j 7^ j'. Lastly x(hj) an d x{i' ■> j) are correlated for i ^ i' and 

(x(Mk(i^> = t|- ■ (47) 



This correlation together with 

(X(ij)) = ^- (48) 

(X(ij) 2 ) = ^- + j£- (49) 

<x(i,i)x(«',/)> = -^- for j^f (50) 
when used in (45) leads to 

(wiig + l)) = ( Wi (g)) 
as expected, since the definition (6) of w was chosen to keep (w) = 1, and 



{a ' 19 + 1)2) = (I + i*b) (tt, - (9)2> + f 1 - idfc) {w ' i9)w " ig)) (51) 

{»,(;, + 1)10,(9 + 1)> = -JL-^) 2 ) + h - -JL-J {^(sluvlj)) (52) 

where i ^ i' (the index ( a ) has been omitted for simplicity). 

From (1,2,6), we know that Y,i w i(.9) — N g , and (wi(g)) = 1. Thus for i 7^ «' 

(^(S)"'*'!?)) = ; (53) 



and (51) becomes 
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So far this evolution equation is exact. 
If we consider that all the iV 3 's are very large, (54) becomes 

{w i ( g + i) 2 ) = ±{w i ( g ) 2 ) + i, 

so that for large g (in fact g should not be too large to keep N g large enough, 
more precisely g should be such that (p/2) 9 <C iVo <C p 9 ), the second moment 
of w has a limiting value (wi(g) 2 ) — > and we see from (53) that 

( Wl (g)w t/ (g)} - 1 = (w) 2 . (56) 

When one repeats the above calculation for higher correlations (we did it up 
to three-point correlations), one finds that the correlations between the terms 
in the r.h.s. of (45) are negligible. This indicates that these correlations can be 
neglected (of course a complete proof that all correlations are negligible in the 
scaling limit would be much better than our guess based on the computation 
of the lowest correlations). 

One can repeat the above calculation of correlations for several variants of 
the model, like those discussed at the end of section 2. The exact formulae 
(51,52,54) are modified but one always find that, in the scaling regime, they 
reduce to (55,56), meaning that the correlations could be ignored. 

6.2 The case of a population of constant size with p' parents per individual 

Let us consider only the case where each individual has p' parents. To keep 
the notations simple, we will limit the calculation to the case of a population 
of constant size 

N g = N 

One can then follow the same steps as above. Starting from (13), one replaces 
(45) by 

4 Q) (5+l) = iEx(^>! a) (9). (57) 
V j= i 

The correlations (47-50) become in this case 



N n 



(54) 



(55) 
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p'ip' — 1) 

W 2 



for i 7^ i' 



(58) 



(59) 



pf_ p'(p' - 1) 
AT N 2 



(60) 



(x(iJ)x(i' J')) 




for j 7^ / 



(61) 



and (51,52) read 




p'N 





{wi(g)wi>(g)). 



{wi(g)wi>(g)) 



(62) 



(63) 



For large g and large A, we see (using the fact that J2i w i(.9) — N) that 
(wi(g) 2 ) — > p'/(p' — 1) and (wi(g)wi>(g)) — >• 1 as p — >• oo. This again indicates 
that correlations can be neglected for large g and large AT. 
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